Metropolis-Hastings Algorithms for Estimating Betweenness Centrality in Large Networks

نویسندگان

  • Mostafa Haghir Chehreghani
  • Talel Abdessalem
  • Albert Bifet
چکیده

Betweenness centrality is an important index widely used in different domains such as social networks, traffic networks and the world wide web. However, even for mid-size networks that have only a few hundreds thousands vertices, it is computationally expensive to compute exact betweenness scores. Therefore in recent years, several approximate algorithms have been developed. In this paper, first given a network G and a vertex r ∈ V (G), we propose a MetropolisHastings MCMC algorithm that samples from the space V (G) and estimates betweenness score of r. The stationary distribution of our MCMC sampler is the optimal sampling proposed for betweenness centrality estimation. We show that our MCMC sampler provides an (ǫ, δ)approximation, where the number of required samples depends on the position of r in G and in many cases, it is a constant. Then, given a network G and a set R ⊂ V (G), we present a Metropolis-Hastings MCMC sampler that samples from the joint space R and V (G) and estimates relative betweenness scores of the vertices in R. We show that for any pair ri, rj ∈ R, the ratio of the expected values of the estimated relative betweenness scores of ri and rj respect to each other is equal to the ratio of their betweenness scores. We also show that our joint-space MCMC sampler provides an (ǫ, δ)-approximation of the relative betweenness score of ri respect to rj , where the number of required samples depends on the position of rj in G and in many cases, it is a constant. 1998 ACM Subject Classification G.2.2 [Discrete Mathematics] Graph Theory, Graph algorithms

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Better Approximation of Betweenness Centrality

Estimating the importance or centrality of the nodes in large networks has recently attracted increased interest. Betweenness is one of the most important centrality indices, which basically counts the number of shortest paths going through a node. Betweenness has been used in diverse applications, e.g., social network analysis or route planning. Since exact computation is prohibitive for large...

متن کامل

On Local and Global Centrality in Large Scale Networks

Estimating influential nodes in large scale networks including but not limited to social networks, biological networks, communication networks, emerging smart grids etc. is a topic of fundamental interest. To understand influences of nodes in a network, a classical metric is centrality within which there are multiple specific instances including degree centrality, closeness centrality, betweenn...

متن کامل

Path Centrality: A New Centrality Measure in Social Networks

Processing large graphs is an emerging and increasingly important computation in a variety of application domains, from social networking to genomics and marketing. One of the important and computationally challenging structural graph metrics is node betweenness centrality, a measure of influence of a node in the graph. The best known algorithm for computing exact betweenness centrality runs in...

متن کامل

Approximating Betweenness Centrality in Large Evolving Networks

Betweenness centrality ranks the importance of nodes by their participation in all shortest paths of the network. Therefore computing exact betweenness values is impractical in large networks. For static networks, approximation based on randomly sampled paths has been shown to be significantly faster in practice. However, for dynamic networks, no approximation algorithm for betweenness centrali...

متن کامل

A Fast Approach to the Detection of All-Purpose Hubs in Complex Networks with Chemical Applications

A novel algorithm for the fast detection of hubs in chemical networks is presented. The algorithm identifies a set of nodes in the network as most significant, aimed to be the most effective points of distribution for fast, widespread coverage throughout the system. We show that our hubs have in general greater closeness centrality and betweenness centrality than vertices with maximal degree, w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1704.07351  شماره 

صفحات  -

تاریخ انتشار 2017